Vez£l 



1 to 3640 ^ (J 

fU CGGGGGGAGTGGGGAGGAGGGGC^^PrCGCCGCAGCCATGGAGGCCAACTCWACCGCGTTC 

|fl ^ MEAK'WTAF l f q a h e a 



kTCCCAT 
S H 17 



ATTACTCAGAAACCTCAGGCTGCACCAGAAACATTAAAGGATGCCATTGGGATTAAAAAAGAAAAACCCAA 270 
ITQKPQAAPETLKDAIGIKKEKPKTSFV C T 77 

TACTGCAGTAAAGCATTCAGGGACAGCTATCACCTGAGGCGCCATCAGTC^ 560 

XCSRAFRPSYBfrRRBpgCH tgiklvsrakki07 

ACCCCCACCACGGTGGTTCCCCTTATCTCCACCATTGC^^ 450 
TPTTVVPLISTIAGDSSRTSLVSTIAGILS137 

ACAGTCACTACATCTTCCTCGGGCACCAACCCCAGCAGCAGCGCTAGTACCACAGCAA 540 
TVTTS S SGTNPSS SASTTAMPVPQS VKKPS 167 

AAGCCTGTCAAGAAGAACCACGCCTGTGAGATGTGTGGGAAGGCCTTCCGGGATGTGTACCACCT 630 
KPVKKMHA CEMCGKAFRDVY HLMRHKL3H S 197 

GACGAAAAGCCCTTTGAGTGTCCTATTTGTAATCAGCGCTTCAAG 720 
D E K P F E CP ICNORFKRKD RMTYffVRfl H EGG 227 

ATCACCAAACCCTATACTTGCAGTGTTTGTGGGAAAGGCTTCTC^ 810 
1 T K P Y T C3VCGKGF3RP OHLSCHVKHVH S T 257 

GAAAGACCCTTCAAATGCCAAACGTGCACTGCTGCCTTTGCCACCAAAGA 900 
E R P F K CQTCTAAFATKDRLRT HMVRg E G K V 287 

U, TCATGTAACATCTGTGGGAAGCTCTTGAGTGCAGCATATATCAC 990 

— S CNICGK&LSAAYITSHLKTH GQSQSINCN 317 

f~ m ACGTGCAAACAAGGCATCAGCAAAACGTGCATGAGTGAGGAGACCAGCAATC^ 1080 

TCKQGISKTCMSEETSMQKQQQQQQQQQQQ 347 

yi ************ 

5 ; CAACIAACAACAACATGTGACAAGCTGGCCAGGGAAGCAGGTAGAGACACTGAGACTGTGGGAAGAAGCT 1170 

6 QQQQHVTSWPGKQVETI»RI»WEEAVKARKKE 377 
Hi • • * • 

! GCtGCCAACCTGlWCAAACCTCCACGGCTGCTACGACACCAGTCACTCTCA 1260 
%S AAMLCQTSTAATTPV TLTTPFHITS3VSSG407 

ACTATGTCAAACCCAGTCACAGTGGCAGCTGCAATGAGCATGAGAAGTCCA^ 1350 
_ TMSNPVTVAAAMSHRSPVNVSSAVNITSPI* 437 

Q GCCATGACCTCACCTTTAACACTCACCACCCCAGTCAACCTCCCCACC^ 1440 

AMTSPLTLTTPVNLPTPVTAPVNIAH PVT1 467 



TSPMNLPTPMTLAAPLNIAMRPVE SMPFLP 497 



1530 



H 

—-. CAAGCTTTGCCTACGTCACCGCCTTGGTAAACAGTATT^^ 162Q 
W QALPTSPPW 506 

n| 

GATrAAAGCAAAAAGCAGACTATGAAATTCGGAGGTTTTATTATGTTAGTTAATAA 1110 

AAAGTAGGGTATATGTOTAACTTATCACTGGACCACTTTAGTTtACTC^ 1800 

GCTGGCAAGACGAAATGCCAGAATTAAAACCAATCATAAAACCCATTTCAAAAT 1890 

GAGAGCTAGTGTAGCATTGTCTGTGGTTTTGCTCGTATGAGTC 2070 

TACCTTTGGGAATATGTTAATTTCAGGCAGCATTCCCTAT 2160 

TCTAACTTCAACATATAATAGGGATTGTGACCTGATATTTGGAGA^ 2250 

CTAGTTGACTTTTTAAAAAAAAAAAAACAAAAAAAAAAAAAAGGAATTCCGAG^ 2340 

GGTGGCTTCAGAACCCAGGAAGTGGCCAAGGGGCACAGACTCTGCTGGAGGCCI^ 2430 

TTTGCCTTAGGCCACAAAAAGAAGAAGGCTACCCCACTTACAGATGCAGACCA 2520 

AGTGTTGGCAGCAGATGGTACTACTGAGCATGTCTACAGACCCAG 2610 

GACCTTTGTGCAGCGTGTATACCAGCCTTACCTCACCACTT^ 2700 

TGGAGCAGCAATATGCCAGCCTCCATGTGGGAATGGAGGGAGTTCATCCG^ 2880 

CTTGCCAGACAGATGTTGATGAATGCAGTACAGGAGAGGCCAGTTGTCCCCAGCGCTGTGTC 2970 

GATGGGAGGGACAAAGCCCATCnXAGATGGGACCCCCTTC 3060 
TGGACAGCATGGCGAGAGAGGAGGTGTACAGGCTGCAGGCTCGGGTTGATGTGCT^ . 3150 

AGTCCCACAAGACACCACTCCACCCACAGAGAGCCTAGGGGACCCATGGGGTGGACACCAGGGCTC 3330 

GGGCAGATTTGCAAGTTTACACCTTTTTTCCTCTCCTG^ 3420 

TAGAGCAGTATCCAGATCCCTTGTAGCCAGAGTTCAGGGACGCTGTCTGGTGGTGCCTATGAGCAGAAGCCCTC 3510 

TGTCTCCATCTTTGTCATAATAAAGCTGAGACTTGAAAAAAAAAA 3645 



Human DB1 DNA and Protein S q nces: 

10 20 30 40 50 60 

AGCGGGGGGAGTGGGGAGGAGGGGGGTCGGCCGCCGCAGCCATGGAGGCCAACTGGACCG 

M E A N W T> 

70 80 90 100 110 120 

CGTTCCTGTTCCAGGCCCATGAAGCTTCCCATCACCAACAGCAGGCAGCACAGAACAGCT 
AFLFQAHEASHHQQQAAQNS> 

130 140 150 160 170 180 

TGCTGCCCCTCCTGAGCTCTGCCGTGGAGCCCCCTGATCAGAAACCATTGCTTCCAATAC 
LLPLLSSAVEPPDQKPLLPI> 

190 200 210 220 230 240 

CAATAACTCAGAAACCTCAGGGTGCACCAGAAACATTAAAGGATGCCATTGGGATTAAAA 
PITQKPQGAPETLKDAIGI K> 

250 260 270 280 290 300 

AAGAAAAACCCAAAACTTCATTTGTGTGCACTTACTGCAGTAAAGCTTTCAGGGACAGCT 
KEKPKTSFVCTYCSKAFRDS> 

310 320 330 340 350 360 

ATCACCTGAGGCGCCACGAATCCTGCCACACAGGGATCAAGTTGGTGTCCCGGCCAAAGA 
YHLRRHESCHTGIKLVSRP K> 

370 380 390 400 410 . 420 

AAACCCCCACCACGGTGGTTCCCCTTATCTCTACCATCGCTGGGGACAGCAGCCGAACTT 
KTPTTVVPLISTIAGDSSR T> 

430 440 450 460 470 480 

CGTTGGTCTCGACCATTGCAGGCATCTTGTCAACAGTCACTACATCTTCCTCGGGCACCA 
SLVSTIAGILSTVTTSSSGT> 

490 500 510 520 530 540 

ACCCCAGTAGCAGTGCCAGCACCACAGCTATGCCAGTGACCCAGTCTGTCAAGAAACCCA 
NPSSSASTTAMPVTQSVKKP> 

550 560 570 580 590 600 

GTAAGCCTGTCAAGAAGAACCATGCTTGTGAGATGTGTGGGAAGGCCTTCCGAGATGTGT 
SKPVKKNHACEMCGKAFRD V> 

610 620 630 640 650 660 

ACCATCTCAATCGACACAAGCTCTCCCATTCAGATGAGAAACCCTTTGAGTGTCCTATTT 
YHLNRHKLSHSDEKPFECPI> 

670 680 690 700 710 720 

GTAATCAGCGCTTCAAGAGGAAGGACCGGATGACTTACCATGTGAGGTCTCATGAAGGAG 
CNQRFKRKDRMTYHVRSHE G> 

730 740 750 760 770 780 

GCATCACCAAACCCTATACTTGCAGTGTTTGTGGGAAAGGCTTCTCAAGGCCTGACCACT 
GITKPYTCSVCGKGFSRPD H> 
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iff) 

FtfrvlLG ± Cco/ur ) 

790 800 810 820 830 840 

TAAGCTGTCATGTAAAACATGTCCATTCAACAGAAAGACCCTTCAAATGCCAAACGTGCA 
LSCHVKHVHSTERPFKCQTO 

850 860 870 880 890 900 

CTGCTGCCTTTGCCACCAAAGACAGACTGCGGACACACATGGTGCGCCATGAAGGCAAGG 
TAAFATKDRLRTHMVRHEG K> 

910 920 930 940 950 960 

TATCATGTAACATCTGTGGGAAGCTCCTGAGTGCAGCATACATCACCAGCCACTTAAAGA 
VSCNICGKLLSAAYITSHL K> 

970 980 990 1000 1010 1020 

CTCATGGGCAGAGCCAAAGTATCAACTGTAATACATGTAAACAAGGCATCAGTAAAACAT 
THGQSQS INCNTCKQ GISKT> 

1030 1040 1050 1060 1070 1080 

GCATGAGTGAAGAGACCAGTAACCAAAAGCAGCAGCAGCAGCAGCAGCAGCAACAACAAC 
CMS EETSNQKQQQ.QQQQQQQ> 

1090 1100 1110 1120 1130 1140 

AACAACAACATGTGACAAGCTGGCCAGGGAAGCAAGTAGAAACACTCAGACTGTGGGAAG 
QQQHVTSWPGKQVETLRLWE> 

1150 1160 1170 1180 1190 1200 

AAGCTGTTAAAGCAAGGAAGAAAGAAGCTGCTAACCTGTGCCAAACCTCCACGGCTGCTA 
EAVKARKKEAANLCQTSTA A> 

1210 1220 1230 1240 1250 1260 

CGACACCTGTGACTCTCACTACTCCATTCAGTATAACATCCTCTGTGTCGTCTGAGACTA 
TTPVTLTTPFSITSSVSSET> 

1270 1280 1290 1300 1310 1320 

TGTCAAACCCAGTCACAGTGGCAGCTGCAATGAGCATGAGAAGTCCAGTAAATGTTTCAA 
MSNPVTVAAAMSMRS P V N V S> 

1330 1340 1350 1360 1370 1380 

GTGCAGTTAACATAACCAGCCCAATGAACATAGGGCATCCTGTAACTATAACCAGTCCAT 
SAVNI TS PMNIGH PVT I t S P> 

1390 1400 1410 1420 1430 1440 

TATCCATGACCTCTCCTTTAACACTCACTACCCCAGTCAACCTCCCCACCCCCGTCACTG 
LSMTSPLTLTTPVNLPTPV T> 

1450 1460 1470 1480 1490 1500 

CCCCAGTGAATATAGCACACCCTGTCACCATCACATCTCCAATGAATCTACCCACACCTA 
APVNIAHPVTITSPMNLPTP> 



1510 1520 1530 1540 1550 1560 

TGACATTAGCCGCCCCTCTCAATATAGCAATGAGACCTGTAGAGAGCATGCCTTTCTTGC 
MTLAAPLNIAMRPVESMPFL> 



of^uo Vc ^^J 

1570 1580 1590 1600 1610 1620 

CCCAAGCTTTGCCTACATCACCGCCTTGGTAAACAGTATTATAAAATCAAAATATGGGTA 
PQALPTSPPW*> 

1630 1640 1650 1660 1670 1680 

AAAGTAAATATTTACCAGCAACTTAACTTTTAGTTGATTAAAGCAAAAAGTAAACCATGA 

1690 1700 1710 1720 1730 1740 

AATTGGGAGATTTTATTACATTAGTTAATAAGAGTGTGGTAGCATTTTTCTCCAATTTGG 

1750 1760 1770 1780 1790 1800 

CTGGGATTATTCAAAGTAGGGTGTGTATGTAACTTATCACTGGACCACTTTAGTTTAATC 

1810 1820 1830 1840 1850 1860 

AGAAATTCCTTTTAGCTGACAACATTGCTTAAACAGGATAGTAGTTGGCAAGATGAAATG 

1870 1880 1890 1900 1910 1920 

CCAGAATTAAAACCAATCATAAGTAGAACCCACTTCAAAATAAAAAAACAGCATTACTAT 

1930 1940 1950 1960 1970 1980 

TTCTAATCCCAAGGAATCACTTTATTGTAAACACTAGCAGAACTCTTCTCCCTATACAAG 

1990 2000 2010 2020 2030 2040 

GTGGATGGCTGATTTTAACCTGAAATTTTAAATCCACAGATTGAGAGCTAGTGTAGAATT 

2050 2060 2070 2080 2090 2100 

GTCTGTGTTTATTGTTTTTATGAGTAAATACATGCATTGTCATAATAAAATGCATTTCAG 

2110 2120 2130 2140 2150 2160 

AGAATATGCATTTTACCTTTGGGAATATGTTAATTTCAGGCAGCATTCCCTATGGGAAAG 

2170 2180 2190 2200 2210 2220 

GTGATACCAGCTCTGATATGCAAAGCATATGATAATTTATCATTCTAACTTCAACGTATA 

2230 2240 2250 2260 2270 2280 

ATAGGGATTGTGACCTGATATTTGGAGATGTAAATATTGCTCAGCATATTAATCCCGATG 



2290 2300 
GAATATAGCATTGTAGTTGACTTTTT 
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A. R troviral 
Entrapment 
V ctor 



XZ 



EcoRI 



Iltr 







AP 



supF 



neo 



— LTR 



B. Infection of ES cells 
Isolation of G418 r clones 



C. Screen for AP expression 
in embryofd bodies 




supF 



C5 (?) C?> J 
Undifferentiated 
ES cells 



Embryoid bodies 

D. Selection of ES cell clones with 
differential AP expression in vitro 



Terminally differ ntiat 
embryoid bodi s 



E. Aggregation chimeras 
Transfer into pseudo- 
pregnant foster females 




. Cloning of 
flanking cellular DNA 
by supF complem ntation 
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A: Jackson BSS Chromosome 2 



D2Mit6 

Abi1 

Pax8 

Vezfl 

Abf 

Psmb7 
Acvr2a 



psii 



odHflfl 

4534 2 2 1 1 1 1 4 3 



R SB 

2.13 1.49 

2.13 1.49 

2.13 1.49 

1.06 1.06 

1.06 1.06 

7.45 2.71 



B: Jackson BSS Chromosome 2 



- - D2Mit1, D2Bir2 

- - D2MH4, D2MH31 

- - Jund2, Mpmv3 
-f- D2Mit178, D2MK464 

D2Mit6 

Abi1 

-f- D2Bir5, Pax8 

- - Lhx3, Notch 1, Vezfl 

- - Abf, Eng. Pbx3 
Psmb7 

5cM 



- - Acvr2a, D2Mit155 
H- CfeJws, D2MH8 
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Restriction Enzyme Map of a 20 kb Genomic DNA of the Vezfl Gene 
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Xh 



4 



1 5 kb Ea 3 kb 

BamHI (B), EcoRI (E), EcoRV (E5), Eagl (Ea), Notl (N), Patl (P), Sad (S), 
Xbal (Xb), and Xhol (Xh). 

— Intronic sequence; 
pTlExon 1 
[F] Exon 2 



VezfVEXPRESSION VECTORS 

CMV/ VezM cDNA IRES AP SV40 
CMV-V Zf1 ,Tie2; ; , intron+p(A) 

Ti 2-v zfi k^^M I Hill T^foi'iiliiii?; 



CMV-AZF-Vezf1 30 9 9 92 

Ti 2-AZF-Vezf1 II K^ijijijiiiij 5 ;! 

CMV-Vezf1-ZF , 309 992 

Ti 2-Vezf1-ZF liW^M I I Jv^;:;:;:;:.;.;! 



CMV-AP 
Ti 2-AP 
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